• Big Data Pipelines on the Computing Continuum: Ecosystem and Use Cases Overview 

      Roman, Dumitru; Nikolov, Nikolay; Soylu, Ahmet; Elvesæter, Brian; Song, Hui; Prodan, Radu; Kimovski, Dragi; Marrella, Andrea; Leotta, Francesco; Matskin, Mihhail; Ledakis, Giannis; Theodosiou, Konstantinos; Simonet-Boulogne, Anthony; Perales, Fernando; Kharlamov, Evgeny; Ulisses, Alexandre; Solberg, Arnor; Ceccarelli, Raffaele (Peer reviewed; Journal article, 2021)
      Organisations possess and continuously generate huge amounts of static and stream data, especially with the proliferation of Internet of Things technologies. Collected but unused data, i.e., Dark Data, mean loss in value ...
    • Big Data Pipelines on the Computing Continuum: Tapping the Dark Data 

      Roman, Dumitru; Prodan, Radu; Nikolov, Nikolay; Soylu, Ahmet; Matskin, Mihhail; Marrella, Andrea; Kimovski, Dragi; Elvesæter, Brian; Simonet-Boulogne, Anthony; Ledakis, Giannis; Song, Hui; Leotta, Francesco; Kharlamov, Evgeny (Peer reviewed; Journal article, 2022)
      The computing continuum enables new opportunities for managing big data pipelines concerning efficient management of heterogeneous and untrustworthy resources. We discuss the big data pipelines lifecycle on the computing ...
    • Big data workflows: Locality-aware orchestration using software containers 

      Corodescu, Andrei-Alin; Nikolov, Nikolay; Khan, Akif Quddus; Soylu, Ahmet; Matskin, Mihhail; Payberah, Amir H.; Roman, Dumitru (Peer reviewed; Journal article, 2021)
      The emergence of the Edge computing paradigm has shifted data processing from centralised infrastructures to heterogeneous and geographically distributed infrastructures. Therefore, data processing solutions must consider ...
    • Conceptualization and scalable execution of big data workflows using domain-specific languages and software containers 

      Nikolov, Nikolay; Dessalk, Yared Dejene; Khan, Akif Quddus; Soylu, Ahmet; Matskin, Mihhail; Payberah, Amir; Roman, Dumitru (Peer reviewed; Journal article, 2021)
      Big Data processing, especially with the increasing proliferation of Internet of Things (IoT) technologies and convergence of IoT, edge and cloud computing technologies, involves handling massive and complex data sets on ...
    • ContrastNER: Contrastive-based Prompt Tuning for Few-shot NER 

      Layegh, Amirhossein; Hossein Payberah, Amir; Soylu, Ahmet; Roman, Dumitru; Matskin, Mihhail (Chapter, 2023)
      Prompt-based language models have produced encouraging results in numerous applications, including Named Entity Recognition (NER) tasks. NER aims to identify entities in a sentence and provide their types. However, the ...
    • DataCloud: Enabling the Big Data Pipelines on the Computing Continuum 

      Roman, Dumitru; Nikolov, Nikolay; Elvesæter, Brian; Soylu, Ahmet; Prodan, Radu; Kimovski, Dragi; Marrella, Andrea; Leotta, Francesco; Benvenuti, Dario; Matskin, Mihhail; Ledakis, Giannis; Simonet-Boulogne, Anthony; Perales, Fernando; Kharlamov, Evgeny; Ulisses, Alexandre; Solberg, Arnor; Ceccarelli, Raffaele (Chapter, 2021)
    • Locality-Aware Workflow Orchestration for Big Data 

      Corodescu, Andrei-Alin; Nikolov, Nikolay; Khan, Akif Quddus; Soylu, Ahmet; Matskin, Mihhail; Payberah, Amir; Roman, Dumitru (Chapter; Peer reviewed, 2021)
      The development of the Edge computing paradigm shifts data processing from centralised infrastructures to heterogeneous and geographically distributed infrastructure. Such a paradigm requires data processing solutions that ...
    • Smart Data Placement for Big Data Pipelines: An Approach based on the Storage-as-a-Service Model 

      Khan, Akif Quddus; Nikolov, Nikolay Vladimirov; Matskin, Mihhail; Prodan, Radu; Song, Hui; Roman, Dumitru; Soylu, Ahmet (Chapter, 2022)
      The development of big data pipelines is a challenging task, especially when data storage is considered as part of the data pipelines. Local storage is expensive, hard to maintain, comes with several challenges (e.g., data ...
    • TranSQL: A Transformer-based Model for Classifying SQL Queries 

      Tahmasebi, Shirin; Hossein Payberah, Amir; Soylu, Ahmet; Roman, Dumitru; Matskin, Mihhail (Chapter, 2022)
      Domain-Specific Languages (DSL) are becoming popular in various fields as they enable domain experts to focus on domain-specific concepts rather than software-specific ones. Many domain experts usually reuse their ...
    • TRANSQLATION: TRANsformer-based SQL RecommendATION 

      Tahmasebi, Shirin; Payberah, Amir H.; Soylu, Ahmet; Roman, Dumitru; Matskin, Mihhail (Chapter, 2023)
      The exponential growth of data production emphasizes the importance of database management systems (DBMS) for managing vast amounts of data. However, the complexity of writing Structured Query Language (SQL) queries requires ...